Patch Similarity Aware Data-Free Quantization for Vision Transformers

نویسندگان

چکیده

Vision transformers have recently gained great success on various computer vision tasks; nevertheless, their high model complexity makes it challenging to deploy resource-constrained devices. Quantization is an effective approach reduce complexity, and data-free quantization, which can address data privacy security concerns during deployment, has received widespread interest. Unfortunately, all existing methods, such as BN regularization, were designed for convolutional neural networks cannot be applied with significantly different architectures. In this paper, we propose PSAQ-ViT, a Patch Similarity Aware framework Transformers, enable the generation of "realistic" samples based transformer's unique properties calibrating quantization parameters. Specifically, analyze self-attention module's reveal general difference (patch similarity) in its processing Gaussian noise real images. The above insights guide us design relative value metric optimize approximate images, are then utilized calibrate Extensive experiments ablation studies conducted benchmarks validate effectiveness even outperform real-data-driven methods. Code available at: https://github.com/zkkli/PSAQ-ViT.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Distributed query-aware quantization for high-dimensional similarity searches

The concept of similarity is used as the basis for many data exploration and data mining tasks. Nearest Neighbor (NN) queries identify the most similar items, or in terms of distance the closest points to a query point. Similarity is traditionally characterized using a distance function between multi-dimensional feature vectors. However, when the data is high-dimensional, traditional distance f...

متن کامل

Class-Aware Similarity Hashing for Data Classification

This paper introduces “class-aware similarity hashes” or “classprints,” which are an outgrowth of recent work on similarity hashing. The approach builds on the notion of context-based hashing to create a framework for identifying data types based on content and for building characteristic similarity hashes for individual data items that can be used for correlation. The principal benefits are th...

متن کامل

Interval Similarity-Based Quantization Method for Continuous Data

Data quantization methods for continuous attributes play an extremely important role in artificial intelligence, data mining and machine learning because discrete values of attributes are required in most classification methods. In this paper, we present an interval similarity-based quantization method for continuous data. It defines an interval similarity criterion which is regarded as a new m...

متن کامل

Bohr: Similarity Aware Geo-distributed Data Analytics

We propose Bohr, a similarity aware geo-distributed data analytics system that minimizes query completion time. The key idea is to exploit similarity between data in different data centers (DCs), and transfer similar data from the bottleneck DC to other sites with more WAN bandwidth. Though these sites have more input data to process, these data are more similar and can be more efficiently aggr...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Lecture Notes in Computer Science

سال: 2022

ISSN: ['1611-3349', '0302-9743']

DOI: https://doi.org/10.1007/978-3-031-20083-0_10